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Abstract: This report contains expository notes about a func- 
tion that is popularly known as the Lovasz number of a 
graph G. There are many ways to define and the surpris- 

ing variety of different characterizations indicates in itself that 
^?(G') should be interesting. But the most interesting property 
of 'd{G) is probably the fact that it can be computed efficiently, 
although it lies "sandwiched" between other classic graph num- 
bers whose computation is NP-hard. I have tried to make these 
notes self-contained so that they might serve as an elementary 
introduction to the growing literature on Lovasz's fascinating 
function. 
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The Sandwich Theorem 



It is NP-complete to compute <jj{G), the size of the largest chque in a graph G, and it is 
NP-complete to compute x{G), the minimum number of colors needed to color the vertices 
of G. But Grotschel, Lovasz, and Schrijver proved [5] that we can compute in polynomial 
time a real number that is "sandwiched" between these hard-to-compute integers: 



Lovasz [13] called this a "sandwich theorem." The book [7] develops further facts about the 
function '&{G) and shows that it possesses many interesting properties. Therefore I think 
it's worthwhile to study '(^(G) closely, in hopes of getting acquainted with it and finding 
faster ways to compute it. 

Caution: The function called in [13] is called in [7] and [12]. I am following 
the latter convention because it is more likely to be adopted by other researchers — [7] is a 
classic book that contains complete proofs, while [13] is simply an extended abstract. 

In these notes I am mostly following [7] and [12] with minor simplifications and a few 
additions. I mention several natural problems that I was not able to solve immediately 
although I expect (and fondly hope) that they will be resolved before I get to writing 
this portion of my forthcoming book on Combinatorial Algorithms. I'm grateful to many 
people — especially to Martin Grotschel and Laszlo Lovasz — for their comments on my first 
drafts of this material. 

These notes are in numbered sections, and there is at most one Lemma, Theorem, 
Corollary, or Example in each section. Thus, "Lemma 2" will mean "the lemma in sec- 
tion 2" . 

0. Preliminaries. Let's begin slowly by defining some notational conventions and by 
stating some basic things that will be assumed without proof. All vectors in these notes 
will be regarded as column vectors, indexed either by the vertices of a graph or by integers. 
The notation x > y, when x and y are vectors, will mean that Xy > y^ for all v. If ^ is a 
matrix. Ay will denote column v, and A^y will be the element in row u of column v. The 
zero vector and the zero matrix and zero itself will all be denoted by 0. 

We will use several properties of matrices and vectors of real numbers that are familiar 
to everyone who works with linear algebra but not to everyone who studies graph theory, 
so it seems wise to list them here: 

(i) The dot product of (column) vectors a and h is 



(*) 



a ■ b = a^b ; 



(0.1) 
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the vectors are orthogonal (also called perpendicular) if a • 6 = 0. The length of vector a is 



\\a\\ = Va~^. (0.2) 

Cauchy's inequality asserts that 

a-6< ||a|| (0.3) 

equality holds iff a is a scalar multiple of 6 or 6 = 0. Notice that if A is any matrix we 
have 

n n 
{A A.)y_y = ^ ] (A )ukA.kv — ^ ] ^few^fev = ■ j (0-4) 
fe=l fc=l 

in other words, the elements of A^A represent all dot products of the columns of A. 

(ii) An orthogonal matrix is a square matrix Q such that Q^Q is the identity ma- 
trix /. Thus, by (0.4), Q is orthogonal iff its columns are unit vectors perpendicular to 
each other. The transpose of an orthogonal matrix is orthogonal, because the condition 
Q^Q = I implies that is the inverse of Q, hence QQ^ — I. 

(iii) A given matrix A is symmetric (i.e., A = A'^) iff it can be expressed in the form 

A = QDQ'^ (0.5) 

where Q is orthogonal and D is a diagonal matrix. Notice that (0.5) is equivalent to the 
matrix equation 

AQ = QD, (0.6) 

which is equivalent to the equations 

AQy Qv^V 

for all f , where = D^jy. Hence the diagonal elements of D are the eigenvalues of A and 
the columns of Q are the corresponding eigenvectors. 

Properties (i), (ii), and (iii) are proved in any textbook of linear algebra. We can get 
some practice using these concepts by giving a constructive proof of another well known 
fact: 

Lemma. Given k mutually perpendicular unit vectors, there is an orthogonal matrix 
having these vectors as the first k columns. 

Proof. Suppose first that k = 1 and that x is a d-dimensional vector with ||a;|| = 1. If 
xi = 1 we have X2 = ■ ■ ■ = Xd = 0, so the orthogonal matrix Q = I satisfies the desired 
condition. Otherwise we let 



yi 



A/(l-a;i)/2, yj = -Xj/{2yi) forl<j<d. (0.7) 
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Then 



T II ii2 2 

y y = \\y\\ = 2/1 + 



xl + --- + xl 



1 — Xi 1 — X? 



= 1 



2 ' 2(1 -xi) 
And X is the first column of the Householder [8] matrix 

Q = I-2yy^, (0.8) 

which is easily seen to be orthogonal because 

Now suppose the lemma has been proved for some k > 1; we will show how to increase 
A; by 1. Let Q be an orthogonal matrix and let a; be a unit vector perpendicular to its first 
k columns. We want to construct an orthogonal matrix Q' agreeing with Q in columns 1 
to k and having x in column k + 1. Notice that 



Q^x 





\y/ 



by hypothesis, where there are Os in the first k rows. The {d — /c)-dimensional vector y has 
squared length 

\\y\\^ = Q^x ■ Q^x = x^QQ^x = x'^x = 1 , 

so it is a unit vector. (In particular, y 7^ 0, so we must have k < d.) Using the construction 
above, we can find a {d — k) x {d — k) orthogonal matrix R with y in its first column. Then 
the matrix 

/I \ 


1 

Rj 



Q' = Q 



\ 



does what we want. □ 



1. Orthogonal labelings. Let G be a graph on the vertices V. If u and v are distinct 
elements of V, the notation u — v means that they are adjacent in G; u -/- v means they 
are not. 

An assignment of vectors ay to each vertex v is called an orthogonal labeling of G if 
c^u ' O'v = ^ whenever u -/- v. In other words, whenever a^^ is not perpendicular to ay 
in the labeling, we must have u — v in the graph. The vectors may have any desired 
dimension d; the components of ay are ajy for 1 < j < d. Example: a„ = for all v always 
works trivially. 
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The cost c(a^,) of a vector in an orthogonal labehng is defined to be if a„ = 0, 
otherwise 

( \ __ ^Iv _ ^Iv 

CKflv) — II 119 — 2 I I 2 • 

Notice that we can multiply any vector a„ by a nonzero scalar without changing its 
cost, and without violating the orthogonal labeling property. We can also get rid of a zero 
vector by increasing d hy 1 and adding a new component to each vector, except that 
the zero vector gets the new component 1. In particular, we can if we like assume that all 
vectors have unit length. Then the cost will be of„. 

Lemma. If S QV is a stable set of vertices (i.e., no two vertices of S are adjacent) and 
if a is an orthogonal labeling then 

5^c(a,)<l. (1.1) 

ves 

Proof. We can assume that \\av\\ = 1 for all v. Then the vectors for v ^ S must 
be mutually orthogonal, and Lemma tells us we can find a, d x d orthogonal matrix Q 
with these vectors as its leftmost columns. The sum of the costs will then be at most 

Relation (1.1) makes it possible for us to study stable sets geometrically. 

2. Convex labelings. An assignment x of real numbers Xy to the vertices v of G is 
called a real labeling of G. Several families of such labelings will be of importance to us: 

The characteristic labeling for [/ C 1/ has x„ = | ^ ^ ^ ^' 

lO n V f U. 

A stable labeling is a characteristic labeling for a stable set. 
A clique labeling is a characteristic labeling for a clique (a set of mutually adjacent 
vertices). 

STAB(G) is the smallest convex set containing all stable labelings, 
i.e., STAB(G) = convex hull {a; | a; is a stable labeling of G}. 
QSTAB(G') = { a; > I E^eQ Xy < 1 for aU cliques Q of G } . 
TH(G) = {a; > I J2yev c{ay)xy < 1 for all orthogonal labelings a of G} . 

Lemma. TH is sandwiched between STAB and QSTAB; 

STAB(G) C TH(G) C QSTAB(G) . (2.1) 



Proof. Relation (1.1) tells that every stable labeling belongs to TH(G). Since TH(G) is 
obviously convex, it must contain the convex hull STAB(G). On the other hand, every 
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clique labeling is an orthogonal labeling of dimension 1. Therefore every constraint of 
QSTAB(G') is one of the constraints of TH(G'). □ 

Note: QSTAB first defined by Shannon [18], and the first systematic study of STAB was 
undertaken by Padberg [17]. TH was first defined by Grotschel, Lovasz, and Schrijver in [6]. 

3. Monotonicity. Suppose G and G' are graphs on the same vertex set with G C G' 
(i.e., u — V in G implies u — v 'm G'). Then 

every stable set in G" is stable in G, hence STAB(G') D STAB(G"); 

every chque in G is a clique in G', hence QSTAB(G) D C)STAB(G'); 

every orthogonal labeling of G is an orthogonal labeling of G', 

hence TH(G') D TH(G"). 

In particular, if G is the empty graph on \V\ — n vertices, all sets are stable and all 
cliques have size < 1, hence 

STAB(¥„) = TH(Zn) = QSTAB(Zn) = {x \ < Xy < 1 for all v}, the n-cube. 

If G is the complete graph K^, all stable sets have size < 1 and there is an n-clique, so 

STAB{Kn) = TH(K^) = QSTAB{Kn) = {x > \J2^Xy < 1}, the n-simplex. 

Thus aU the convex sets STAB(G), TH(G), QSTAB(G) lie between the n-simplex and the 
n-cube. 

Consider, for example, the case n = 3. Then there are three coordinates, so we can 
visualize the sets in 3-space (although there aren't many interesting graphs). The QSTAB 

X y z _ 

of • • • is obtained from the unit cube by restricting the coordinates to a; + y < 1 and 

y + 2 < 1; we can think of making two cuts in a piece of cheese: 
001 



100 *^ ^ 010 

The vertices {000, 100, 010, 001, 101} correspond to the stable labelings, so once again we 
have STAB(G) = TH(G') = QSTAB(G). 

4. The theta function. The function "d{G) mentioned in the introduction is a special 
case of a two-parameter function ^{G, w), where w is a, nonnegative real labeling: 

'd{G,w) = max{w ■ x\ X em{G)}; (4.1) 

i&{G) = i&{G, 1) where 1 is the labeling Wy = 1 for all v. (4.2) 
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This function, called the Lovdsz number of G (or the weighted Lovdsz number when w ^ 1), 
tells us about 1-dimensional projections of the n-dimensional convex set TH(G). 

Notice, for example, that the monotonicity properties of §3 tell us 

GCG' ^ ^{G,w)>^{G',w) (4.3) 
for all w > 0. It is also obvious that i? is monotone in its other parameter: 

w<w' ^ d{G, w) < ^{G, w') . (4.4) 
The smallest possible value of § is 

{){Kn,w) ^m.?OL{wi, . . . ,Wn]; d{Kn) = I ■ (4.5) 

The largest possible value is 

d{Kri,w)=Wi + --- + Wn-, 1»(Kn)=n. (4.6) 

Similar definitions can be given for STAB and QSTAB: 

a{G, w) = max{ wx\x e STAB(G) } , a{G) = a{G, 1) ; (4.7) 
/i(G,w) =max{«;-x I X e QSTAB(G)}, k{G) ^ k{G,1) . (4.8) 

Clearly a{G) is the size of the largest stable set in G, because every stable labeling x 
corresponds to a stable set with 1 • x vertices. It is also easy to see that k,{G) is at most 
x(G), the smallest number of cliques that cover the vertices of G. For if the vertices can 
be partitioned into k cliques Qi, ■ ■ .,Qk and if a; e QSTAB(G), we have 

1 ■ X — Xy + ■ ■ ■ + Xy <k . 

veQi veQk 

Sometimes k,{G) is less than x(G). For example, consider the cyclic graph C^, with 
vertices {0, 1, ... ,n — 1} and u — v iS u ^ v ± 1 (mod 1). Adding up the inequalities 
xq + xi < 1, . . . , Xn-2 + Xn-i < 1, Xn-i + xq < 1 of QSTAB gives 2{xo + ■ ■ ■ + Xn-l) < 
and this upper bound is achieved when all x's are i; hence K,{Cn) = ^, if n > 3. But x(G) 
is always an integer, and x(C„) = is greater than K{Cn) when n is odd. 

Incidentally, these remarks establish the "sandwich inequality" (*) stated in the in- 
troduction, because 

a{G) < ^{G) < k{G) < x{G) (4.9) 
anda;(G) = a(G),x(G') = x(G). 
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5. Alternative definitions of -d. Four additional functions i^i, 1^2, 1^3, 1^4 are defined 
in [7], and they all turn out to be identical to -d. Thus, we can understand in many- 
different ways; this may help us compute it. 

We will show, following [7] , that if w is any fixed nonnegative real labeling of G, the 
inequalities 

^{G, w) < ^i{G, w) < ^2{G, w) < ^3{G, w) < MG, w) < ^{G, w) (5.1) 

can be proved. Thus we will establish the theorem of [7] , and all inequalities in our proofs 
will turn out to be equalities. We will introduce the alternative definitions -dk one at a 
time; any one of these definitions could have been taken as the starting point. First, 

'di(G,w) = min m.8Lx(wy/c(ay)) , over all orthogonal labelings a. (5-2) 

a V 

Here we regard Wy/c^ay) — when — c(a^) = 0; but the max is 00 if some > has 
c(a^) = 0. 

Lemma. d{G,w) <'di{G,w). 

Proof. Suppose x G TH(G) maximizes w ■ x, and suppose a is an orthogonal labeling that 
achieves the minimum value '&i{G,w). Then 



't?(G, w) = w ■ X 



WyXy < ( max I c(ay)xy < max = -di (G, w) . □ 

^ V ^ c{ay)J ^ V c{ay) 



Incidentally, the fact that all inequalities are exact will imply later that every nonzero 
weight vector w has an orthogonal labeling a such that 

v{G, w) 

We will restate such consequences of (5.1) later, but it may be helpful to keep that future 
goal in mind. 

6. Characterization via eigenvalues. The second variant of is rather different; this 
is the only one Lovasz chose to mention in [13]. 

We say that A is a. feasible matrix for G and it; if A is indexed by vertices and 

A is real and symmetric; 

Ayy — Wy Sll V ^ V] 

Auv = \/wuWy whenever u -/- v va. G (6-1) 
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The other elements of A are unconstrained (i.e., they can be anything between — oo and 
+oo). 

If A is any real, symmetric matrix, let A(^) be its maximum eigenvalue. This is well 
defined because all eigenvalues of A are real. Suppose A has eigenvalues {Ai, . . . , An}; then 
A = (5diag(Ai, . . . , Xn)Q^ for some orthogonal Q, and ||Qa;|| = ||a;|| for all vectors x, so 
there is a nice way to characterize A(^): 

A(A) = max{ x^Ax | = (6.2) 

Notice that A(^) might not be the largest eigenvalue in absolute value. We now let 

'&2{G, w) = min{ A{A) | ^ is a feasible matrix for G and w } . (6.3) 

Lemma. '&i{G,w) < ■d2{G,w). 

Proof. Note first that the trace tr^ = Ylv'^v > for any feasible matrix A. The trace 
is also well-known to be the sum of the eigenvalues; this fact is an easy consequence of the 
identity 

m n 

tvXY = Y,Y. ^JkVkj = trYX (6.4) 
j=i k=i 

valid for any matrices X and Y of respective sizes mxn and nxm. In particular, '§2 {G, w) 
is always > 0, and it is = if and only if = (when also 'di{G^ tu) = O). 

So suppose w ^ Q and let ^ be a feasible matrix that attains the minimum value 
A(^) = ^2{G,w) = A > 0. Let 

B = \I-A. (6.5) 

The eigenvalues of B are A minus the eigenvalues of A. (For \i A = Q diag(Ai, . . . , \n)Q^ 
then B — Q diag(A — Ai, . . . , A — \n)Q^ ^ Thus they are all nonnegative; such a matrix B 
is called positive semidefinite. By (0.5) we can write 

B = X'^X, i.e., Buv = Xu-Xy, (6.6) 

when X = diag(yA — Ai, . . . , ^/X — Xn ■ 

Let tty = {^/w^ ,xiv, . . .,XrvY- Then c{av) = w^/ ||a„|p = Wy/{wy + a;f „ H \- x'^^) 

and xf^ + ■ ■ ■ + x'^^ = Byy = A — hence c{ay) = Wy/X. Also if u -/- v we have 
du ■ Civ = ^/WyWy Xu ■ Xy = ^/ w uW y + B ^y = ^/ w yW y — A^y — 0. Therefore a is an 
orthogonal labeling and max^; Wy/c{ay) — X> 'di{G,w). □ 

7. A complementary characterization. Still another variation is based on orthogonal 
labelings of the complementary graph G. 
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In this case we let b be an orthogonal labeling of G, normalized so that WbvW^ = 1) 
and we let 




6 is a normalized orthogonal labeling of G j . (7.1) 

A normalized orthogonal labeling b is equivalent to an n x n symmetric positive semidefinite 
matrix B, where B^y = bu ■ by is zero when u — v, and where ti B = 1. 

Lemma. ■d2{G,w) < ■{}^{G,w). 

This lemma is the "heart" of the proof that all -i^s are equivalent, according to [7]. It 
relies on a fact about positive semidefinite matrices that we will prove in §9. 

Fact. If A is a symmetric matrix such that A ■ B > for all symmetric positive semi- 
definite B with Buy = for u — v, then A — X + Y where X is symmetric positive 
semidefinite and Y is symmetric and Yyy — for all v and Yuv = for u -/- v. 

Here C B stands for the dot product of matrices, i.e., the sum J^u v ^uvBuv, which can 
also be written iiC'^B. The stated fact is a duality principle for quadratic programming. 

Assuming the Fact, let W be the matrix with W^v — \/wuWy, and let ■i^s = '^■^{G, w). 
By definition (7.1), if b is any nonzero orthogonal labeling of G (not necessarily normalized), 
we have 

U,V V 

In matrix terms this says W ■ B < (t^sI) • B for all symmetric positive semidefinite B with 
Buy = for tt — V. The Fact now tells us we can write 

■dsl -W = X + Y (7.3) 

where X is symmetric positive semidefinite, Y is symmetric and diagonally zero, and 
Y^y = when u -/- v. Therefore the matrix A defined by 

A = W + Y = 'd3l - X 

is a feasible matrix for G, and A(^) < i^^. This completes the proof that ■d2{G,w) < 
■&3{G,w), because A(^) is an upper bound on 'i?2 by definition of 'i?2- □ 
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8. Elementary facts about cones. A cone in A'"-dimensional space is a set of vectors 
closed under addition and under multiplication by nonnegative scalars. (In particular, it 
is convex: If c and c' are in cone C and Q < t < 1 then tc and (1 — t)c' are in C, hence 
tc+ {1 — t)c' e C.) A closed cone is a cone that is also closed under taking limits. 

Fl. If C is a closed convex set and x ^ C, there is a hyperplane separating x from C. 
This means there is a vector y and a number b such that c ■ y < 6 for all c e C but x-y > b. 

Proof. Let d be the greatest lower bound of \\x — c|p for all c G C Then there's a 
sequence of vectors Ck with — Cfc|p < d+l/k; this infinite set of vectors contained in the 
sphere {y \ \\x — yW^ < d + 1} must have a limit point Cqo, and c^o ^ C since C is closed. 
Therefore \\x — CoolP > d; in fact ||a: — Coo|P = d, since \\x — Coo|| < \\x — Ck\\ + \\ck — Coo|| 
and the right-hand side can be made arbitrarily close to d. Since x ^ C, we must have 
d > 0. Now let y = X — Coo and h = Coo • y. Clearly x-y = y- y + b>b. And if c is any 
element of C and e is any small positive number, the vector ec + (1 — e)coo is in C, hence 
\\x — (ec + (1 — e)coo) II > d. But 

||a; - [ec + (1 - e)coo)||^ - d = \\x - Coo - e(c - Coo)\\^ - d 

= -2ey ■ {c- Coo) + ||c - Coo||^ 

can be nonnegative for all small e only if y • (c — Cqo) < 0, i.e., c - y < b. □ 

If A is any set of vectors, let A* = { 6 | a ■ 6 > for all a E A}. 
The following facts are immediate: 

F2. AC A' implies A* D A'*. 

F3. A C A**. 

F4. A* is a closed cone. 

From Fl we also get a result which, in the special case that C = { Ax | a; > } for a 
matrix A, is called Farkas's Lemma: 

F5. If C is a closed cone, C = C**. 

Proof. Suppose x G C** and x ^ C, and let {y,b) be a separating hyperplane as in Fl. 
Then {y, 0) is also a separating hyperplane; for we have x ■ y > b > because G C, and 
we cannot have c ■ y > for c G C because (Ac) ■ y would then be unbounded. But then 
c • {—y) > for all c G C, so —y G C*; hence x ■ {—y) > 0, a contradiction. □ 

If A and B are sets of vectors, we define A + B = {a + b\ a E A and b E B}. 



The Electronic Journal of Combinatorics 1 (1994), # Al 



12 



F6. If C and C are closed cones, (C n C")* = C* + C* . 

Proof. If A and B are arbitrary sets we have A* + B* C (An B)*, for if x e A* + B* 
and y G A n S then x-y = a- y + b- y>0. If A and S are arbitrary sets including then 
{A + B)* C A* nB* by F2, because A + B D A and A + B D B. Thus for arbitrary A 
and B we have (A* + S*)* C n B** , hence 

Now let ^ and 5 be closed cones; apply F5 to get A* + B* D {An B)* . □ 

F7. If C and C are closed cones, {C + C")* = C* n C*. (I don't need this but I might 
as well state it.) Proof. F6 says (C* n C"*)* = C** + C"**; apply F5 and * again. □ 

F8. Let S be any set of indices and let ^5 = { a | = for all s e S }, and let S be all 
the indices not in S. Then 

A*s = A^. 



Proof. If 6s = for all s ^ 5" and = for all s & S, obviously a ■ 6 = 0; so A-g C A*g. 
If 6s 7^ for some s ^ S and ot = for alH s and = —bs then a e As and a • 6 < 0; 
so 6 ^ A*s, hence % 3 A* . □ 

9. Definite proof of a semidefinite fact. Now we are almost ready to prove the result 
needed in the proof of Lemma 7. 

Let D be the set of real symmetric positive semidefinite matrices (called "spuds" 
henceforth for brevity), considered as vectors in A'"-dimensional space where N = |(n+l)n. 
We use the inner product A-B = tr A'^B; this is justified if we divide off-diagonal elements 
by -\/2. For example, if n = 3 the correspondence between 6-dimensional vectors and 3x3 
symmetric matrices is 



a 



d/V2 e/V2\ 



d/V2 6 f/V2 
\e/V2 f/V2 c J 



{a,h,c,d,ej) 

preserving sum, scalar product, and dot product. Clearly D is a closed cone. 
F9. D* = D. 

Proof. If A and B are spuds then A = X^X and B = Y^Y and yl • S = tr X'^X Y^Y = 
tiXY'^YX'^ = (yX^) • (YX^) > 0; hence D C D*. (In fact, this argument shows that 
^ • S = iff AB = 0, for any spuds A and B, since A = A'^.) 
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If A is symmetric but has a negative eigenvalue A we can write 

A = gdiag(A,A2,...,An) 

for some orthogonal matrix Q. Let B = Q diag (1,0,..., 0)Q^; then 5 is a spud, and 

A-B = tT A^B = tr Q diag (A, 0, . . . , 0) = A < . 

So A is not in D*] this proves D D D*. □ 

Let E be the set of all real symmetric matrices such that E^v = when u — v in a 
graph G; let F be the set of all real symmetric matrices such that Fuy = when u = v or 
u -/- V. The Fact stated in Section 7 is now equivalent in our new notation to 

Fact. {DnE)*CD + F. 
But we know that 

{DnE)*=D*+E* byF6 

= D + F by F9 and F8. □ 

10. Another characterization. Remember t^i, ■§2, and i^s? We are now going to 
introduce yet another function 



'!?4(G, w) = max < c{by)wy 



b is an orthogonal labeling of G > . (10-1) 



Lemma. -d^iG^w) < ■d^{G,w). 



Proof. Suppose 6 is a normalized orthogonal labeling of G that achieves the maximum 1^3; 
and suppose the vectors of this labeling have dimension d. Let 



= ^kvVw^, for 1 < /c < d ; (10.2) 



V 



then 

T^siG^w) = ^Vw^by, ■ by^/w^^ Y VwuWy bkuhv = X^a^fe • 

u,v u,v,k k 

Let Q be an orthogonal d x d matrix whose first row is (iCi/y^, . . . , Xd/ V^)'^ 1 and let 
by = Qby. Then b'y^ ■ by = b^Q^Qby = by by = bu ■ by, so b' is a normalized orthogonal 
labeling of G. Also 



x'k = Y KvVw^ = Y QkjbjvVw^ 



v,3 



' , k> 1. 



3 
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Hence by Cauchy's inequality 

= J2ciK)'^v<MG,'^) (10.4) 

V 

because EJI^J; IP = EJI^^^ IP = 1- □ 

11. The final fink. Now we can close the loop: 
Lemma. 'd4^{G,w) < i){G,w). 

Proof. If h is an orthogonal labeling of G that achieves the maximum 'i?4, we will show 
that the real labeling x defined by Xy = c{by) is in TH(G). Therefore d^{G^w) = w ■ x is 
< ^{G,w). 

We will prove that if a is any orthogonal labeling of G, and if b is any orthogonal 
labeling of G, then 

J2<ay)c{by)<l. (11.1) 

V 

Suppose a is a labeling of dimension d and b is of dimension d'. Then consider the d x d' 
matrices 

Ay = aybl (11.2) 
as elements of a vector space of dimension dd' . li u ^ v we have 

Au-Ay = tr AlA^ = tr b^a^a^b^ = tr a^a^b^b^ = , (11.3) 

because a^a„ = when u -/- v and byb^ = when u — v. If u = v we have 

Ay • Ay = ||CI^|| ll^'vll . 

The upper left corner element of Ay is ai„6i„, hence the "cost" of Ay is (ai„6i„)^/ ||^.y|P = 
c{ay)c{by). This, with (11.3), proves (11-1). (See the proof of Lemma 1.) □ 

12. The main theorem. Lemmas 5, 6, 7, 10, and 11 establish the five inequalities 
claimed in (5.1); hence all five variants of are the same function of G and w. Moreover, 
all the inequalities in those five proofs are equalities (with the exception of (11.1)). We 
can summarize the results as follows. 



The Electronic Journal of Combinatorics 1 (1994), # Al 15 

Theorem. For all graphs G and any nonnegative real labeling w of G we have 

^{G,w) = MG,'^) ^ MG,w) ^ ^3{G,w) ^ MG,w) . (12.1) 

Moreover, if w 0, there exist orthogonal labelings a and b of G and G, respectively, such 
that 

c{ay) = Wy/^; (12.2) 
J2<ay)c{by) = l. (12.3) 

Proof. Relation (12.1) is, of course, (5.1); and (12.2) is (5.3). The desired labeling b is 
what we called b' in the proof of Lemma 10. The fact that the application of Cauchy's 
inequality in (10.4) is actually an equality, 

tells us that the vectors whose dot product has been squared are proportional: There is a 
number t such that 



\\by\\=t ^^^f^ , if by ^0; \\bv\\ = iff 6i„V^=0. (12.5) 
The labeling in the proof of Lemma 10 also satisfies 

E ll^-f = 1; (12-6) 

V 

hence t = ±1 / . 
We can now show 

c{by) = \\byf i}/wy , whcu tu^ 7^ . (12.7) 
This relation is obvious if = 0; otherwise we have 

117, ||2 

c{b.) = ^ = §^ 

\\by\\^ t^Wy 

by (12.5). Summing the product of (12.2) and (12.7) over v gives (12.3). □ 

13. The main converse. The nice thing about Theorem 12 is that conditions (12.2) 
and (12.3) also provide a certificate that a given value is the minimum or maximum 
stated in the definitions of i^i, 'i?2, f^s, and 'i?4. 
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Theorem. If a is an orthogonal labeling of G and b is an orthogonal labeling of G such 
that relations (12.2) and (12.3) hold for some and w, then is the value of '&{G, w). 

Proof. Plugging (12.2) into (12.3) gives ^ Wyc{by) = hence 'd < '&4{G, w) by definition 
of i?4. Also, 

max — — - — v , 
hence > 'di{G, w) by definition oidi. □ 

14. Another look at TH. We originally defined "OiG, w) in (4.1) in terms of the convex 
set TH defined in section 2: 

t9{G,w) =max{wx\x eTE{G)}, when > . (14.1) 

We can also go the other way, defining TH in terms of 

TH(G') ^{x>0\w-x< 'd{G, w) for aU w > } . (14.2) 

Every x e TH(G) belongs to the right-hand set, by (14.1). Conversely, if x belongs to the 
right-hand set and if a is any orthogonal labeling of G, not entirely zero, let Wy = c(a„), 
so that w ■ X = c{ay)xy. Then 

'di(G,w) < mBi:x(wy/c{ay)) = 1 

V 

by definition (5.2), so we know by Lemma 5 that '^c{ay)xy < 1. This proves that x 
belongs to TH(G'). 

Theorem 12 tells us even more. 
Lemma. TH(G) = { a; > | i&(G, x)<l}. 
Proof. By definition (10.1), 

'i?4(G, w) = max ^ ^ c{ay)wy | a is an orthogonal labeling of G | . (14-3) 
Thus X e TH(G') iff i&4(G, x) < 1, when a; > 0. □ 

Theorem. TH(G) — {x \ Xy — c{by) for some orthogonal labeling b of G}. 

Proof. We already proved in (11.1) that the right side is contained in the left. 

Let X e TH(G') and let i9 = t9{G,x). By the lemma, ^<1. Therefore, by (12.2), there 
is an orthogonal labeling b of G such that c{by) = Xy/i9 > Xy for all v. It is easy to reduce 
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the cost of any vector in an orthogonal labehng to any desired value, simply by increasing 
the dimension and giving this vector an appropriate nonzero value in the new component 
while all other vectors remain zero there. The dot products are unchanged, so the new 
labeling is still orthogonal. Repeating this construction for each v produces a labeling with 

C{by) = Xy. □ 

This theorem makes the definition of 'd^ in (10.1) identical to the definition of 'd 
in (4.1). 

15. Zero weights. Our next result shows that when a weight is zero, the corresponding 
vertex might as well be absent from the graph. 

Lemma. Let U be a subset of the vertices Vofa graph G, and let G' = G\U be the graph 
induced by U (i.e., the graph on vertices U with u — v in G' iffu — v in G). Then if w 
and w' are nonnegative labehngs of G and G' such that 

Wy = Wy when v & U , Wy = when v ^ U , (1^.1) 

we have 

^{G,w) =^{G',w') . (15.2) 

Proof. Let a and b satisfy (12.2) and (12.3) for G and w. Then c{ay) = ior v ^ U, so 
a\U and b\U satisfy (12.2) and (12.3) for G' and w'. (Here a\U means the vectors ay for 
V E U.) By Theorem 13, they determine the same 'd. □ 

16. Nonzero weights. We can also get some insight into the significance of nonzero 
weights by "splitting" vertices instead of removing them. 

Lemma. Let v be a vertex of G and let G' be a graph obtained from G by adding a new 
vertex v' and new edges 

u — v' iff u — V. (16.1) 
Let w and w' be nonnegative labehngs of G and G' such that 

Wu = w'^, when u ^ v\ (16.2) 

Wy = w'y + w'y, . (16.3) 

Then 

^{G,w)^^{G',w'). (16.4) 

Proof. By Theorem 12 there are labehngs a and b of G and G satisfying (12.2) and 
(12.3). We can modify them to obtain labehngs a' and b' of G' and G' as follows, with the 
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vectors of a' having one more component than the vectors of a: 

a'u = ' K = K, when u v; (16.5) 



Civ\ I I O^v \ / W^' II II a I ^( 



-.= \^)^ a,,= ^_^^, a=y-j||a.||, /?=y-fK||; (16.6) 

h'^ = b'^, = by . (16.7) 

(We can assume by Lemma 15 that w'y and w'^, are nonzero.) All orthogonality relations 
are preserved; and since v -/- v' in G', we also need to verify 

a'y ■ a'y, = ||a„|p — aP = . 

We have 

, c{ay) \\av\\^ _ c{ay) _ c{ay)w'^ _ 

||a,||2 + a2 - l + w'^,/w', ~ Wy ~ ^ ' 

and similarly c{a'^,) = w'y,/i9; thus (12.2) and (12.3) are satisfied by a' and b' for G' and w'. 
□ 

Notice that if all the weights are integers we can apply this lemma repeatedly to 
establish that 

i}{G,w) =i}iG') , (16.8) 

where G' is obtained from G by replacing each vertex by a cluster of Wy mutually 
nonadjacent vertices that are adjacent to each of f 's neighbors. (Recall that '&{G') — 
t?(G', 1), by definition (4.2).) In particular, if G is the trivial graph K2 and if we assign 
the weights M and A^, we have 'd[K2, (M^N)'^^ — '&{Km,n) where Km,n denotes the 
complete bipartite graph on M and N vertices. 

A similar operation called "duplicating" a vertex has a similarly simple effect: 

Corollary. Let G' he constructed from G as in the lemma but with an additional edge 
between v and v' . Then ■d{G,w) = d{G\w') if w' is defined by (16.2) and 

Wy = max(w^, w'^,) . (16.9) 

Proof. We may assume that Wy — w'^ and 10'^, ^ 0. Most of the construction (16.5)~ 
(16.7) can be used again, but we set a = and b'^, = and 
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Once again the necessary and sufficient conditions are readily verified. □ 

If the corollary is applied repeatedly, it tells us that is unchanged when we 

replace the vertices of G by cliques. 

17. Simple examples. We observed in section 4 that ■&{G,w) always is at least 

T^min = 'd{Kn, w) = max{wi, . . . , Wn} (17.1) 

and at most 

"i^max = {Kn, w) = Wi -\ \- Wn ■ (17.2) 

What are the corresponding orthogonal labelings? 

For Kn the vectors of a have no orthogonal constraints, while the vectors of b must 
satisfy bu ■ by — for all u ^ v. We can let a be the two-dimensional labeling 



«^ = ( ) ' ^ = ^min (17.3) 



SO that WayW^ = and c{ay) = Wy/d as desired; and b can be one-dimensional, 

(1), ift; = t;max 
(0), if^^ 



(17.4) 



where Vmax is any particular vertex that maximizes Wy. Clearly 

Y,<ay)c{by) = ^ !^ = 1. 



For Kn the vectors of a must be mutually orthogonal while the vectors of b are 
unrestricted. We can let the vectors a be the columns of any orthogonal matrix whose top 
row contains the element 

V^V^, ^ = l?max (17.5) 

in column v. Then ||ai,|p = 1 and c(a„) = Wyf-d. Once again a one-dimensional labeling 
suffices for b; we can let by = (1) for all v. 

18. The direct sum of graphs. Let G = G' + G" be the graph on vertices 

V = V'UV" (18.1) 

where the vertex sets V and V" of G' and G" are disjoint, and where u — in G if and 
only if u, f G V and u — f in G', or u, f G V" and u — f in G" . In this case 

i?(G, w) = ^{G', w') + ^{G", w") , (18.2) 
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where w' and w" are the sublabehngs of w on vertices of V' and V" . We can prove (18.2) 
by constructing orthogonal labehngs (a, 6) satisfying (12.2) and (12.3). 

Suppose a! is an orthogonal labeling of G' such that 

\\a'X = ^' a;, = V<, (18.3) 

and suppose a" is a similar orthogonal labeling of G" . If a' has dimension d' and a" has 
dimension d" , we construct a new labeling a of dimension d = d' + d" as follows, where j' 
runs from 2 to and j" runs from 2 to d": 

iiveV if V e 

«i« = \A< = a'l^ : ait; = ^ '^'iv ' 

Oi'« = vW^?^«i'^ , aj'v = , (18.4) 

0'{d'+j")v = , a(^d'+j")v = yfOW' (^'jn^ . 

Now if u, f G V we have 

^ ^" ^ 



thus • = when a!^ - o!^= 0, and 



|a.f = ^||a;f (18.6) 



It follows that cioiy) = Wy/d as desired. A similar derivation holds for u,v E V". And if 
ueV',v e V", then 

au ■ ay = ^/^^ - ^/w{/w^ = . (18.7) 



The orthogonal labeling 6 of G' + G" is much simpler; we just let 6„ = by for v ^ V 
and by = 6:^' for d e V. Then (12.2) and (12.3) are clearly preserved. This proves (18.2). 

There is a close relation between the construction (18.4) and the construction (16.6), 
suggesting that we might be able to define another operation on graphs that generalizes 
both the splitting and direct sum operation. 

19. The direct cosum of graphs. If G' and G" are graphs on disjoint vertex sets V 
and V" as in section 18, we can also define 
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This means u — v in G if and only if either u — v in G' or u — v in G" or u and v 
belong to opposite vertex sets. In this case 

^{G, w) = max(i?(G", w'), ^{G" , w")) (19.2) 

and again there is an easy way to construct (a, b) from (a', b') and (a", b") to prove (19.2). 
Assume "without lots of generality" that 

^{G',w')>^{G",w") (19.3) 

and suppose again that we have (18.3) and its counterpart for a". Then we can define 

iiv eV iiv e V" 

aiv = "/uv = a'lv ) cbiv = Vwv" = a'l^ , 

aj'v = a'j,^ , aj'y = , (19.4) 

Oid'+Dv = 0, a^d'+i)v = v/(^'-^"KV^", 

Now is essentially unchanged when v ^V'\ and when u,v & V" we have 

a„-a, = V« + %K- < - V« ) = ^<-<- (19-5) 

Again we retain the necessary orthogonality, and we have c(a„) = w^/'d for all v. 
For the 6's, we let by = 6^ when v &V' and = when v e F". 

20. A direct product of graphs. Now let G' and G" be graphs on vertices V and V" 
and let F be the n = n'n" ordered pairs 

V xV" . (20.1) 

We define the 'strong product', 

G = G' * G" (20.2) 

on V by the rule 

(«', u") — {v', v") or («', «") = {v', v") in G 

^ {u' — v' or u' = v' in G') and {u" — v" or u" = v" in G") . (20.3) 

In this case we have, for example, Kn' * Kn" = Kn'n" and Kn' * i^n" = K^'n"- 
More generally, if G' is regular of degree r' and is regular of degree r", then G' * G" 
is regular of degree (r' + l)(r" + 1) — 1 = r'r" + r' + r" . 
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I don't know the value of '&{G, w) for arbitrary w, but I do know it in the special case 

W(v',v") = w'y,w'l„ . (20.4) 
Lemma. If G and w are given by (20.2) and (20.4), then 

^{G, w) = ^{G', w') ^{G", w") . (20.5) 

Proof. [12] Given orthogonal labelings (a', b') and (a", b") of G' and G", we let a be the 
Hadamard product 

0'ij',j"){v',v") = a'j'v'0'j"v" ^ ^<j'<d', l<j"<d", (20.6) 
where d' and d" are the respective dimensions of the vectors in a' and a". Then 

'^{u',u") ' '^{v',v") — a'j,y^,aj„y^„a'j,y,aj„^„ 

= {a'^, ■ a'^,){a';„ ■ a':.) . (20.7) 

Thus = ||a^/|p and 

c(a(„'i,")) = c{a'^,)c{a'^„) . (20.8) 

The same construction is used for b in terms of b' and b". 

All necessary orthogonalities are preserved, because we have 

{u\u") — {v',v") and {u',u") {v',v") in G 

(u' — v' and u' v' in G') or {y!' — v" and u" v" in G") 
=^ b{u',u") ■ b^y'yi) = ; 
iu\u") -i- {y',v") and («',«") (v' ,v") in G 

(u' -i- v' and u' v' in G') or («" ^ v" and v" in G") 

(In fact one of these relations is <J^, but we need only =^ to make (20.7) zero when it needs 
to be zero.) Therefore a and b are orthogonal labelings of G that satisfy (12.2) and (12.3). 
□ 

21. A direct coproduct of graphs. Guess what? We also define 
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This graph tends to be "richer" than G' * G"; 



we have 



(u\ u") — {v\ v") and {u\ u") ^ {v' , v") in G 

^ {u' — v' and u' ^ v' in G') or {u" — v" and u" ^ v" in G") . (21.2) 



Now, for instance, if G' is regular of degree r' and G" is regular of degree r", then 

G' * G" is regular of degree n'n" — {n' — r'){n" — r") = r'n" + r"n' — r'r" . 

(This is always > r'r" + r' + r", because r'(n" - 1 - r") + r"(n' - 1 - r') > 0.) Indeed, 
G' * G" D G' * G" for all graphs G' and G" . The Hadamard product construction used 
in section 20 can be applied word-for-word to prove that 



when G satisfies (21.1) and w has the special factored form (20.4). 
It follows that many graphs have identical i^'s: 

Corollary. If G' * G" C G C G' * G" and w satisBes (20.4), then (21.3) holds. 

Proof. This is just the monotonicity relation (4.3). The reason it works is that we have 
(^{u',v') ■ (^{u",v") = ^(u'.v') ■ b(u",v") = for all pairs of vertices {u',u") and {v',v") whose 
adjacency differs in G' * G" and G' * G" . □ 

Some small examples will help clarify the results of the past few sections. Let P3 be 
the path of length 2 on 3 vertices, and consider the four graphs we get by taking 

its strong product and coproduct with K2 and K2: 



^{G,w) ='&{G',w')^{G",w") 



(21.3) 



u 



V 



w 



K2* Pz = 



X 



y 



z 



(Since P3 may be regarded as K2 + Ki and K2 is Ki + i^2, this graph is 

((Ki + Ki) + Ki) + ((Ki + Ki) + Ki) 
and the formula for ■§ follows from (18.2) and (19.2).) 



K2* P3 = 



U V w 

X y z 
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(This graph is + -f^4; we could also obtain it by applying Lemma 16 three times to P3.) 



K2 * P3 



U l' w 




X y 




If the weights satisfy u = Xx, v = Xy, w = Xz for some parameter A, the first two formulas 
for both reduce to {1 + X) max{u + w,v), in agreement with (20.5) and (21.3). Similarly, 
the last two formulas for -d reduce to max(l, A) max(t( + w, v) in such a case. 

22. Odd cycles. Now let G = C„ be the graph with vertices 0, 1, . . . , n — 1 and 

u — V <^=^> tt — V = ±1 (mod n) , (22.1) 

where n is an odd number. A general formula for d{Cn,w) appears to be very difficult; 
but we can compute i?(C„) without too much labor when all weights are 1, because of the 
cyclic symmetry. 

It is easier to construct orthogonal labelings of than of C^, so we begin with that. 
Given a vertex w , < f < n, let 6^ be the three-dimensional vector 



a 

by = I cos v(p 

where a and (p remain to be determined. We have 

bu - by = o? + cos uip cos vip + sin u(p sin vip 
= + cos{u — v)ip . 

Therefore we can make b^ ■ by = when u = v ±1 by setting 

7r{n — 1) 



(22.2) 



a = — cos (fi . 



if = 



n 



(22.3) 



(22.4) 



This choice of (f makes nip a multiple of 27r, because n is odd. We have found an orthogonal 
labeling b of C„ such that 

.,2 



c{by) = 



a 



cos Tr/n 



1 + a"^ 1 + cos7r/n ' 



(22.5) 
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a. 



(22.6) 



Turning now to orthogonal labelings of Cm we can use (2n — 1) -dimensional vectors 

a2 cos 2t;(/7 
Q!2 sin 2v(p 

ctn-i cos(n — l)v(p 
\ an-1 sin(n — l)v(fi / 

with = 7r(n — l)/n as before. As in (22.3), we find 

n-l 



' O'v = ctfc cos k{u — v)(f ; 



(22.7) 



k=0 



SO the result depends only on {u — v) mod n. Let a; = e*'''. We can find values of such 
that ttu ■ ay = X(^u-v) mod n by solving the equations 



n-l 



(22.8) 



fc=0 



Now u; is a primitive nth root of unity; i.e., = 1 iS k is a multiple of n. So (22.9) is 
just a finite Fourier transform, and we can easily invert it: For < m < n we have 



n-l 



n—1 n—1 
fc=0 j=0 



In our case we want a solution with X2 = = • • • = x„-2 = 0, and we can set a^o = 1) 
Xn-i — xi = X, so we find 



n af, = xq + oj ^xi + uj'^Xn-t = 1 + 2a; cos k(f . 

We must choose x so that these values are nonnegative; this means 2x < — 1/ cos<^, since 
cos k(p is most negative when k = 1. Setting x to this maximum value yields 



c{ay) = al = - ( 1 



n 



1 + cos7r/n 
cos<^ / ncosTz/n 



So (22.5) and (22.9) give 



^ c{ay)c{by) = ^ 1/n = 1 . 



(22.9) 



(22.10) 
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This is (12.3), hence from (12.2) we know that 'd{Cn) = A. We have proved, in fact, that 

n cos n/n 



^(Cn,]l) = 



1 + COS 7r/n ' 

1 + cos 7r/n 
cos7r/n 



(22.11) 
(22.12) 



When n = 3, C„ = Kn and these values agree with '&{K^) = 1, '^{Ks) = 3; when n = 5, 
C5 is isomorphic to C5 so '^{C^) = ^/5^, when n is large, 

^(Cn) = ^ - ^ + 0(n-3) ; ^Cn) = 2 + 1^ + 0(n-^) . (22.13) 

Instead of an explicit construction of vectors ay as in (22.6), we could also find •d{Cn) 
by using the matrix characterization ^2 of section 6. When all weights are 1, a feasible 
A has 1 everywhere except on the superdiagonal, the subdiagonal, and the corners. This 
suggests that we look at "circulant" matrices; for example, when n = 5, 



A = 



( 1 

1 + a; 

1 
1 

Vl + a; 



l + x 
1 

\ + x 
1 
1 



1 

1 + a; 
1 

1 + X 
1 



1 
1 

1 + X 

1 

1 + a; 



1 
1 

1 + x 

1 / 



(22.14) 



where J is all I's and P is the permutation matrix taking j into (j + 1) mod n. It is well 
known and not difficult to prove that the eigenvalues of the circulant matrix a^I + aiP + 



• + an_iP" ^ are 



w^-'aj , < /c < n, 

0<j<n 



(22.15) 



where a; = e^'^'^l'^ , (indeed, it suffices to find the eigenvalues of P itself. This w is a 
different primitive root of unity from the a; we used in (22.8).) Hence the eigenvalues of 
(22.14) are 



n + 2a; , a;(a; + a; "*") , xiuP" -\- uj ^) , 



(22.16) 



We minimize the maximum of these values if we choose x so that 



n-\-1x = —2xcosTT/n ; 



then 



(22.17) 
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is the value of 'd{G). 

If n is even, the graph Cn is bipartite. We will prove later that bipartite graphs are 
perfect, hence '&{Cn) = n/2 and ^{Cn) = 2 in the even case. 

23. Comments on the previous example. The cycles Cn provide us with infinitely 
many graphs G for which d{G)"&{ G) = n, and it is natural to wonder whether this is true 
in general. Of course it is not: 1£ G — + Kn-m then G — + Kn-m, hence we know 
from Lemmas 18 and 19 that 

i&{G) = m+l, i&{G) = max{l,n-m) . (23.1) 

In particular, we can make ■d{G)'d{ G ) as high as + n/2 when m = [n/2\ . 
We can, however, prove without difficulty that ■d{G)'d{G) > n: 

Lemma. 

'd{G,w)^{G,w') >ww' . (23.2) 

Proof. By Theorem 12 there is an orthogonal labeling a of G and an orthogonal labeling b 
of G such that 

c{ay) = Wy/^{G, w) , c{by) = w'Jd{G, w') . (23.3) 

By (11.1) we have 

Y,<<^v)c{hy)<l. (23.4) 

V 

QED. □ 

24. Regular graphs. When each vertex of G has exactly r neighbors, Lovasz and 
Hoffman observed that the construction in (22.14) can be generalized. Let B be the 
adjacency matrix of G, i.e., the n x n matrix with 

f 1 , if tt — v; ^ ^ 

Buv = \'.^ , (24.1) 



Lemma. If G is a regular graph, 



Proof. Let ^ be a matrix analogous to (22.14), 



A = J + xB. 



(24.3) 
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Since G is regular, the all-l's vector 1 is an eigenvector of B, and the other eigenvectors 
are orthogonal to 1 so they are eigenvectors also of A. Thus if the eigenvalues of B are 

r = A{B) = Al > A2 > • ■ ■ > An = -A(-S) , (24.4) 

the eigenvalues of A are 

n + rx , xX2 , ■ ■ ■ , xXn . (24.5) 

(The Perron-Frobenius theorem tells us that Ai = r.) We have Ai H h A^ = tr{B) = 0, 

so A„ < 0, and we minimize the maximum of (24.5) by choosing n + rx = xX^, thus 

A(^) = xXr, = """^^ 



r - An 



which is the right-hand side of (24.2). By (6.3) and Theorem 12 this is an upper bound 
on 1}. □ 

Incidentally, we need to be a little careful in (24.2): The denominator can be zero, 
but only when G = Kn- 

25. Automorphisms. An automorphism of a graph G is a permutation p of the vertices 
such that 

piu) — piy) iff tt — V. (25.1) 

Such permutations are closed under multiplication, so they form a group. 

We call G vertex- symmetric if its automorphism group is vertex-transitive, i.e., if 
given u and v there is an automorphism p such that p{u) = v. We call G edge- symmetric 
if its automorphism group is edge-transitive, i.e., if given u — v and u' — v' there is an 
automorphism p such that p{u) — u' and p(v) — v' or p{u) — v' and p{y) — u' . 

Any vertex-symmetric graph is regular, but edge-symmetric graphs need not be reg- 
ular. For example, 

^^I^ is edge-symmetric, not vertex-symmetric; 

^^^^ is vertex-symmetric, not edge-symmetric. ( | is a maximal clique) 

The graph Cn is not edge-symmetric for n > 7 because it has more edges than automor- 
phisms. Also, Cr has no automorphism that takes — 2 into — 3. 

Lemma. If G is edge-symmetric and regular, equality holds in Lemma 24. 

Proof. Say that A is an optimum feasible matrix for G if it is a feasible matrix with 

A(^) = ^(G) 
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as in section 6. We can prove that optimum feasible matrices form a convex set, as follows. 
First, tA+ (1 — t)B is clearly feasible when A and B are feasible. Second, 

K{tA^{l-t)B) <tK{A)^{l-t)K{B) , 0<t<l (25.2) 

holds for all symmetric matrices A and B, by (6.2); this follows because there is a unit 
vector X such that K{tA + (1 - t)B) = x'^ {tA + (1 - t)B)x = tx'^Ax + (1 - t)x'^Bx < 
tA.{A) + (1 — t)A.{B). Third, if A and B are optimum feasible matrices, the right side of 
(25.2) is "&{G) while the left side is > "&{G) by (6.3). Therefore equality holds. 

If A is an optimum feasible matrix for G, so is p{A), the matrix obtained by permuting 
rows and columns by an automorphism p. (I mean p{A)uv = ^p(ii)p(«)-) Therefore the 
average. A, over all p is also an optimal feasible matrix. Since p{A) — A for all automor- 
phisms p, and since G is edge-symmetric, A has the form J + xB where B is the adjacency 
matrix of G. The bound in Lemma 24 is therefore tight. □ 

(Note: If p is a permutation, let Puv = 1 if u = p{v), otherwise 0. Then {P'^AP)uv = 

EPuj^jkPkv = SO p{A) = P^AP.) 

The argument in this proof shows that the set of all optimum feasible matrices A 
for G has a common eigenvector x such that Ax — '&{G)x. The argument also shows 
that, if G has an edge automorphism taking u — v into u' — v' , we can assume without 
loss of generality that Auv = Ay^iyi in an optimum feasible matrix. This simplifies the 
computation of A(^), and justifies our restriction to circulant matrices (22.14) in the case 
of cyclic graphs. 

Theorem. If G is vertex-symmetric, '&{G)'&{G) = n. 

Proof. Say that b is an optimum normalized labeling of G if it is a normalized orthogonal 
labeling of G achieving equality in (7.1) when all weights are 1: 

'd = ^^bu-by, = 1 , bu ■ by = when u — v. (25.3) 

U,V V 

Let B be the corresponding spud; i.e., B^v = ■ by and 'd = J2u v ^uv Then p{B) is also 
equivalent to an optimum normalized labeling, whenever p is an automorphism; and such 
matrices B form a convex set, so we can assume as in the lemma that B = p{B) for all 
automorphisms p. Since G is vertex-symmetric, we must have Byy = 1/n for all vertices v. 
Thus there is an optimum normalized labeling b with = 1/n, and the arguments of 

Lemma 10 and Theorem 12 establish the existence of such a b with 



c{by) = d{G)/n 



(25.4) 
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for all V. But b is an orthogonal labeling of G, hence 

MG, 1) < n/^{G) 

by the definition (5.2) of -di. Thus '&{G)i}{G) < n; we have already proved the reverse 
inequality in Lemma 23. □ 

26. Consequence for eigenvalues. A curious corollary of the results just proved is 
the following fact about eigenvalues. 

Corollary. If the graphs G and G are vertex-symmetric and edge-symmetric, and if the 
adjacency matrix of G has eigenvalues 

Al > A2 > ••• > An, (26.1) 

then 

(Al - Xn){n - Al + A2) = -An(A2 + l)n . (26.2) 
Proof. By Lemma 25 and Theorem 25, 

A{B)+A{-B) A(S)+A(-S) ~ ' ^ ^ ^ 

where B and B are the adjacency matrices of G and G, and where we interpret 0/0 as 1. 
We have 

B = J - I - B . (26.4) 
If the eigenvalues of B are given by (26.1), the eigenvalues of B are therefore 

n-l-Ai > -1-An > ••• > -I-A2. (26.5) 

(We use the fact that G is regular of degree Ai.) Formula (26.2) follows if we plug the 
values A{B) = Ai, A(-B) = -A„, A( S) = n - 1 - Ai, A{-B) = 1 + A2 into (26.3). □ 

27. Further examples of symmetric graphs. Consider the graph P{m, t, q) whose 
vertices are all subsets of cardinality t of some given set S of cardinality m, where 

u — V <(=^ \ur\v\=q. (27.1) 

We want < q < t and m > 2t — q, so that the graph isn't empty. In fact, we can assume 
that m > 2t, because P(m, r, q) is isomorphic to P{m, m — t,m — 2t + q) if we map each 
subset u into the set difference S \u: 

\{S\u)r](S\ v)\ = \S\ -\uUv\ = \S\ - \u\ - \v\ + \ur]v\. (27.2) 
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The letter P stands for Petersen, because P(5, 2, 0) is the well known "Petersen graph" on 
10 vertices, 




15 23 (27.3) 

These graphs are clearly vertex-symmetric and edge-symmetric, because every permutation 
of S induces an automorphism. For example, to find an automorphism that maps u — v 
into u' — f let u = {u\^v)Uu^ V = {u\^v)U v, u' = {u' fl v') U u\ v' = {u' fl v') U v' ^ and 
apply any permutation that takes the q elements of -u fit' into the q elements of u' Hv' , the 
t — q elements of u into the t — q elements of u', and the t — q elements of v into v' . Thus 
we can determine 'd{P{m, t, g)) from the eigenvalues of the adjacency matrix. Lovasz [12] 
discusses the case g = 0, and his discussion readily generalizes to other values of q. It 
turns out that '!?(P(m, t, 0)) = i^Zl)- This is also the value of Q!(P(m, t, 0)), because the 
i^Zl) vertices containing any given point form a stable set. 

The special case t = 2, g = is especially interesting because those graphs also satisfy 
the condition of Corollary 26. We have 



n = 



Ai = 



m — 2 
2 



A2 = l, 



An — 3 



m , 



(27.4) 



and (26.2) does indeed hold (but not "trivially"). It is possible to cover P(m, 2, 0) with 
disjoint maximum cliques; hence K(P(m, 2,0)) = C^) /[-yj = 2[^'| — 1. In particular, 
when G is the Petersen graph we have a{G) = ■i?(G) = 4, k,{G) = 5; also a{G) = 2, 
^(G) = k{G) = f . 

28. A bound on The paper [12] contains one more result about ■& that is not in [7], 
so we will wrap up our discussion of [12] by describing [12, Theorem 11]. 

Theorem. If G has an orthogonal labehng of dimension d with no zero vectors, we have 
^{G) < d. 

Proof. Given a non-zero orthogonal labeling a of dimension d, we can assume that 
||a„|p = 1 for all v. (The hypothesis about zeros is important, since there is trivially an 
orthogonal labeling of any desired dimension if we allow zeros. The labeling needn't be 



The Electronic Journal of Combinatorics 1 (1994), # Al 32 

optimum.) Then we construct an orthogonal labehng a" of dimension d^, with c{a'l) = l/d 
for all V, as follows: 

Let a'y have components where the (j, k) component is ajya^y. Then 

«L • = • o-vf (28.1) 

as in (20.7). Let Q be any orthogonal matrix with (f' rows and columns, such that the 
{j, k) entry in row (1, 1) is l/sfd for j = fc, otherwise. Then we define 

a'l = Qa', . (28.2) 

Once again a'^-a'l — {au-ay)^, so a" is an orthogonal labeling. We also have first component 

hence c(a^') — 1/d. This proves '&{G) < d, by definition of □ 

This theorem improves the obvious lower bound a{G) on the dimension of an optimum 
labeling. 

29. Compatible matrices. There's another way to formulate the theory we've been 
developing, by looking at things from a somewhat higher level, following ideas developed 
by Lovasz and Schrijver [15] a few years after the book [7] was written. Let us say that 
the matrix A is X-compatible with G and if ^ is an (n + 1) x (n + 1) spud indexed by 
the vertices of G and by a special value 0, having the following properties: 

• Aqq = A; 

= Wy for all vertices v; 

• Auy — whenever u -/- v in G. 

Lemma. There exists an orthogonal labehng a for G with costs c{ay) = Wy/X if and only 
if there exists a matrix A that is X-compatible with G and w. 

Proof. Given such an orthogonal labeling, we can normalize each vector so that = 
Wy. Then when Wy we have 

2 

Wy , . a-^ 
— = c{ay) = , 

A Wy 

SO we can assume that aiy = Wy/y/X for all v. Add a new vector ao, having aio = -\/A 
and ttjo = for all j > 1. Then the matrix A with A^v = o-u ' o-v is easily seen to be 
A-compatible with G and w. 
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Conversely, if such a matrix A exists, there are n + 1 vectors ao, . . . , such that 
Auv = du ■ O'v] in particular, ||ao|p = A. Let Q be an orthogonal matrix with first row 
Qq/^/X, and define a'^ = Qa^ for all v. Then a'lQ = -\/A and a^Q = for all j > 1. 
Also a'^ ■ a'^ = ttu ■ tty = Ay,y for all u and v. Hence y/Xa'i^ = g'q ■ a'^ = Aqv = and 
||a^||^ = ■ = A^y = Wy, for all f e G, proving that c(a^) = Wy/X. Finally a' is an 
orthogonal labeling, since a'^ ■ a'^ = Auy = whenever u -/- v. □ 

Corollary, x e TH(G) iff there exists a matrix 1-compatible with G and x. 

Proof. Set A = 1 in the lemma and apply Theorem 14. □ 

The corollary and definition (4.1) tell us that '&{G,w) is ma,x{wixi + ■■■ + WnXn) 
over all x that appear in matrices that are 1-compatible for G and x. Theorem 12 tells 
us that w) is also the minimum A such that there exists a A-compatible matrix for 
G and w. The "certificate" property of Theorem 13 has an even stronger formulation in 
matrix terms: 

Theorem. Given a nonnegative weight vector w = (tfi, . . . , WnY' , let A be X-compatible 
with G and w, where X is as small as possible, and let B be 1-compatible with G and x, 
where wixi + • • • + WnXn is as large as possible. Then 

ADB = 0, (29.1) 

where D is the diagonal matrix with Dqq = — 1 and Dyy = +1 for all v ^ 0. Conversely, 
if A is X-compatible with G and w and if B is 1-compatible with G and x, then (29.1) 
implies that X = wiXi + ■ ■ ■ + Wn^n — '^{G, w). 

Proof. Assume that A is A-compatible with G and tu, and B is 1-compatible with G 
and X. Let B' = DBD, so that B' is a spud with Bqq = 1, B'q^ = B'^q = —Xy, and 
B'^y = By^y when u and v are nonzero. Then the dot product A • B' is 

X — WiXi WnXn — WiXi WnXn -\-WiXi-\ h WnXn = A - {wiXi H h WnXn) , 

because A^yBuy = when u and v are vertices of G. We showed in the proof of F9 
in section 9 that the dot product of spuds is nonnegative; in fact, that proof implies 
that the dot product is zero if and only if the ordinary matrix product is zero. So A = 
wiXi + ■ ■ ■ + WnXn — '&{G, w) iff AB' — 0, and this is equivalent to (29.1). □ 

Equation (29.1) gives us further information about the orthogonal labelings a and b 
that appear in Theorems 12 and 13. Normalize those labelings so that ||a|p = Wy and 
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= Xy. Then we have 

J2'^tibfK)^i^Xy, (29.2) 
teG 

xt {at ■ a^) = Wy , (29.3) 

teG 

^{at ■ a„) {bt - by) = WuXy , (29.4) 

for all vertices u and v of G. (Indeed, (29.2) and (29.4) are respectively equivalent to 
{AB')oy = and {AB')yy = 0; (29.3) is equivalent to {B'A)oy = 0.) Notice that if A and 
B are the n x n spuds; obtained by deleting row and column from optimum matrices 
A and B, these equations are equivalent to 

Bw = ^x, Ax = w, AB = wx'^. (29.5) 

Equation (29.1) is equivalent to (29.5) together with the condition wiXi + ■ — \-WnXn = 

Since AB' = iff B'A = when A and B' are symmetric matrices, the optimum 
matrices A and B' commute. This implies that they have common eigenvectors: There is 
an orthogonal matrix Q such that 

A = Q diag (Ao, . . . , A„) , B' = Q diag (//q, ...,fin)Q'^- (29.6) 

Moreover, the product is zero, so 

AoAto = • • • = Xnl^n = . (29.7) 

The number of zero eigenvalues is n + 1 — d, where d is the smallest dimension for which 
there is an orthogonal labeling a with Auy — au-ay. A similar statement holds for B' , since 
the eigenvalues of B and B' are the same; y is an eigenvector for B iff Dy is an eigenvector 
for B'. In the case G = Cn, studied in section 22, we constructed an orthogonal labeling 
(22.3) with only three dimensions, so all but 3 of the eigenvalues fik were zero. When all 
the weights Wy are nonzero and '(^(G) is large. Theorem 28 implies that a large number of 
Afc must be nonzero, hence a large number of /ik must be zero. 

The "optimum feasible matrices" A studied in section 6 are related to the matrices A 
of (29.5) by the formula 

•dA = ww^ — 1? diag {wi , . . . , Wn) — diag {^/wi, . . . , ^/w^ ) A diag {-s/wi, . . . , ^/w^ ) , (29.8) 

because of the construction following (6.6). If the largest eigenvalue A(^) = of A occurs 
with multiplicity r, the rank of 'dl — A will be n — r, hence A will have rank n — r or 
n — r + 1, and the number of zero eigenvalues A^ in (29.6) will be r + 1 or r. 
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30. Antiblockers. The convex sets STAB, TH, and QSTAB defined in section 2 have 
many special properties. For example, they are always nonempty, closed, convex, and 
nonnegative; they also satisfy the condition 



A set X of vectors satisfying all five of these properties is called a convex corner. 
If X is any set of nonnegative vectors we define its antiblocker by the condition 



Clearly ablX is a convex corner, and ablX D ablX' when X C. X'. 
Lemma. If X is a convex corner we have ablablX = X. 

Proof. (Compare with the proof of F5 in section 8.) The relation X C ablablX is 
obvious by definition (30.2), so the lemma can fail only if there is some z e ablablX with 
z ^ X. Then there is a hyperplane separating z from X, by Fl; i.e., there is a vector y 
and a number b such that a; • j/ < 6 for all a; e X but z - y > b. Let y' be the same as y but 
with all negative components changed to zero. Then (y', b) is also a separating hyperplane. 
[Proof: If a; e X, let x' be the same as x but with all components changed to zero where 
y has a negative entry; then x' G X, and x ■ y' ~ x' • y < b. Furthermore z ■ y' > z ■ y > b.] 
If 6 = 0, we have Xy' G ablX for all A > 0; this contradicts z ■ Xy' < 1. We cannot have 
6 < 0, since G X. Hence b > 0, and the vector y'/b G ablX. But then z ■ (y'/b) must be 
< 1, a contradiction. □ 

Corollary. If G is any graph we have 



< y < X and x G X =^ y G X . 



(30.1) 



ablX = { y > I a; • y < 1 for all a; G X } . 



(30.2) 



STAB(G') = 

TH(G) 
QSTAB (G) 



abl QSTAB (G) , 



abl TH(G) , 
abl STAB(G) . 



(30.3) 



(30.4) 
(30.5) 



Proof. First we show that 



abl X = abl convex hull X. 



(30.6) 



The left side surely contains the right. And any element y G ablX will satisfy 



{aix^^^ + ■ ■ ■ + akx^''^) - y < 1 



when the a's are nonnegative scalars summing to 1 and the x^^^ are in X. This proves 
(30.6), because the convex hull of X is the set of all such aia;^^^ + h akX^'^\ 
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Now (30.6) implies (30.5), because the definitions in section 2 say that 



QSTAB(G') 



abl{ X I X is 3j chque labeling of G } 



STAB(G) 



abl { a; I a; is a stable labeling of G } , 
convex hull { stable labeling of G} . 



And (30.5) is equivalent to (30.3) by the lemma, because STAB(G) is a convex corner. (We 
must prove (30.1), and it suffices to do this when y equals x in all but one component; and 
in fact by convexity we may assume that y is in that component; and then we can easily 
prove it, because any subset of a stable set is stable.) 

Finally, (30.4) is equivalent to Theorem 14, because TH(G) = abl{x | — c(a^) for 
some orthogonal labeling of G } . □ 

The sets STAB and QSTAB are polytopes, i.e., they are bounded and can be defined 
by a finite number of inequalities. But the antiblocker concept applies also to sets with 
curved boundaries. For example, let 



be the intersection of the unit ball and the nonnegative orthant. Cauchy's inequality 
implies that x ■ y < 1 whenever ||x|| < 1 and ||y|| < 1, hence X C ablX. And if y € ablX 
we have y e X, since y ^0 implies = y ■ {yj \\y\\) < 1. Therefore X = ablX. 

In fact, the set X in (30.7) is the only set that equals its own antiblocker. \iY = ably 
and y e F we have y ■ y <1, hence Y C X; this implies ablF D X. 

31. Perfect graphs. Let uj{G) be the size of a largest clique in G. The graph G is 
called perfect if every induced subgraph G' of G can be colored with uj{G') colors. (See 
section 15 for the notion of induced subgraph. This definition of perfection was introduced 
by Claude Berge in 1961.) 

Let G'^ be G with vertex v duplicated, as described in section 16. This means we add 
a new vertex v' with the same neighbors as v and with v — v'. 

Lemma. If G is perfect, so is G^ . 

Proof. Any induced subgraph of G^ that is not G^ itself is either an induced subgraph 
of G (if it omits v or v' or both), or has the form G'^ for some induced subgraph G' of G 
(if it retains v and v'). Therefore it suffices to prove that G^ can be colored with a;(G+) 
colors. 

Color G with ui{G) colors and suppose v is red. Let G' be the subgraph induced 
from G by leaving out all red vertices except v. Recolor G' with uj{G') colors, and assign a 



X = {x>{) \ \\x\\ < 1} 



(30.7) 
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new color to the set G'^\G', which is stable in G"*". This colors G"*" with oj{G') + 1 colors, 
hence uj{G+) < uj{G') + 1. 

We complete the proof by showing that oj{G'^) = oj{G') + 1. Let Q be a clique of size 
oj{G') in G'. 

Case 1. V e Q. Then Q U {v'} is a clique of G^ . 
Case 2. v ^Q. Then Q contains no red element. 
In both cases we can conclude that u){G^) > uj{G') + 1. □ 

Theorem. If G is perfect, STAB(G) = QSTAB(G). 

Proof. It suffices to prove that every x e QSTAB(G) with rational coordinates is a member 
of STAB(G), because STAB(G) is a closed set. 

Suppose X e QSTAB(G) and qx has integer coordinates. Let be the graph obtained 
from G by repeatedly duplicating vertices until each original vertex v of G has been replaced 
by a clique of size qxy. Call the vertices of that clique the clones of v. 

By definition of QSTAB(G), if Q is any clique of G we have 

y^^Xy <1 . 

veQ 

Every clique Q' of G~^ is contained in a clique of size J2veQ ^"-"^ some clique Q of G. 
(Including all clones of each element yields this possibly larger clique.) Thus uj{G^) < q, 
and the lemma tells us that G"*" can be colored with q colors because G"*" is perfect. 

For each color k, where 1 < k < q, let xi'^^ = 1 if some clone of v is colored k, otherwise 
xi''^ = 0. Then x^^^ is a stable labeling. Hence 

1 

- J^a;^'^^ e STAB(G). 
^ fc=i 

But every vertex of G""" is colored, so Ylk=i ~ l^v Sfc=i ^^'^^ = x. \Z\ 

32. A characterization of perfection. The converse of Theorem 31 is also true; but 
before we prove it we need another fact about convex polyhedra. 

Lemma. Suppose P is the set {x>0\x-z<l for all z E Z} = abl Z for some finite 
set Z and suppose y G ablP, i.e., y is a nonnegative vector such that x ■ y < 1 for all 
X e P. Then the set 

Q^{xeP\x-y = l} (32.1) 
is contained in the set {x \ x ■ z — 1} for some z E Z (unless Q and Z are both empty). 

Proof. This lemma is "geometrically obvious" — it says that every vertex, edge, etc., of 
a convex polyhedron is contained in some "facet" — but we ought also to prove it. The 
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proof is by induction on |Z|. If Z is empty, the result holds because P is the set of all 
nonnegative x, hence y must be and Q must be empty. 

Suppose z is an element of Z that does not satisfy the condition; i.e., there is an 
element x E P with x ■ y = 1 and x ■ z ^ 1. Then x ■ z < 1. Let Z' = Z \ {z} and 
P' = abl Z'. It follows that x' -y < 1 for all x' E P'. For ii x' -y > 1, a convex combination 
x" = ea; + (1 — e)x' will lie in P for sufficiently small e, but x" ■ y > 1. 

Therefore by induction, Q' = {x E P' \ x ■ y = 1} is contained in { a; | a; • 2;' = 1 } for 
some z' e Z', unless Q' is empty, when we can take z' = z. And Q C Q', since P C P'. □ 

Theorem. G is perfect if and only if STAB(G) = QSTAB(G). 

Proof. As in section 15, let G\U be the graph induced from G by restriction to vertices U. 
If X is a set of vectors indexed by V and if t/ CV,letX\U be the set of all vectors indexed 
by U that arise from the vectors of X when we suppress all components Xy with v ^ U. 
Then it is clear that 

QSTAB(G|C/) = QSTAB(G)|C/, (32.2) 

because every x e ClSTAB(G|t/) belongs to C)STAB(G) if we set = for i; ^ U, and every 
X e QSTAB(G) satisfies J2v&Q ^ ^ for every clique Q CU. Also 

STAB(G|C/) = STAB(G)|C/ , (32.3) 

because every stable labeling of G\U is a stable labeling of G if we extend it with zeros, 
and every stable labeling of G is stable for G\U if we ignore components not in U. 

Therefore STAB(G) = QSTAB(G) iff STAB(G') = QSTAB(G') for all induced graphs. By 
Theorem 31 we need only prove that STAB(G) = QSTAB(G) implies G can be colored with 
a;(G) colors. 

Suppose STAB(G) = QSTAB(G). Then by Corollary 30, 

STAB(G) = QSTAB(G) . (32.4) 

Let P — STAB(G), and let y = l/u){G). Then x ■ y < 1 whenever x is a clique labeling 
of G, i.e., whenever x is a stable labeling of G; so x ■ y < 1 for all x E P. Let Z be the 
set of all stable labelings of G, i.e., clique labelings of G. Then P = QSTAB(G) = ahlZ 
and Z is nonempty. So the lemma applies and it tells us that the set Q defined in (32.1) 
is contained in{x|x-^ = l} for some stable labeling z of G. Therefore every maximum 
clique labeling x satisfies x ■ 2; = 1; i.e., every clique of size w(G) intersects the stable set S 
corresponding to z. So uj{G') = oj{G) — 1, where 



G' = G\{V\S). 



(32.5) 
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By induction on \V\ we can color the vertices of G' with oj{G') colors, then we can use a 
new color for the vertices of S-^ this colors G with uj{G) colors. □ 

Lovasz states in [13] that he knows no polynomial time algorithm to test if G is perfect; 
but he conjectures ("guesses") that such an algorithm exists, because the results we are 
going to discuss next suggest that much more might be provable. 

33. Another definition of The following result generalizes Lemma 9.3.21 of [7]. 

Lemma. Let a and b be orthogonal labelings of G and G that satisfy the conditions of 
Theorem 12, normaUzed so that 

\Wv\\^\\bv\\^ = Wvc(by) , aiv>0, and feiv > , (33.1) 

for all V. Then 

\ 0, otherwise. 



E%A.=(f(^- ■fri:^^ (33.2) 



Proof. Let ao = (v^, 0, . . . , 0)"^ and bo = (-1, 0, . . . , 0)^. Then the (n + 1) x (n + 1) 
matrices A = a^a and B — b^b are spuds, and A - B — (In the special case Ha^lP = w^, 
= c{bv), matrix B is what we called B' in the proof of Theorem 29.) Therefore 
= ixA^B = tra^ab^b — irbaFab^ = {ab^) ■ {ab'^), and we have ab^ = 0. In other words 



O'jobko + (ijybkv = 



for all j and k. □ 



We now can show that •& has yet another definition, in some ways nicer than the one 
we considered in section 6. (Someday I should try to find a simpler way to derive all these 
facts.) Call the matrix B dual feasible for G and w if it is indexed by vertices and 

B is real and symmetric; 
Byy = Wy for all v & V; 

Buv = whenever u -/- v in G; (33.3) 

and define 

dQ{G,w)=msx{K{B) I 

B is positive semidefinite and dual feasible for G and w } . (33.4) 



(Compare with the analogous definitions in (6.1) and (6.3).) 
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Theorem. i9{G,w) = i&6{G,w). 

Proof. If B is positive semidefinite and dual feasible, and if A is any eigenvalue of B, 
we can write B = QDQ^ where Q is orthogonal and D is diagonal, with Dn = X. Let 
h = \fDQ'^] then h is an orthogonal labeling of G with = for all v. Furthermore 

c{hy) = bf^/wy = Xq'^i/wy, where (gii, • • ■ , Qni) is the first column of Q. Therefore 
^yC{by)wy = X^^q'^i — A, and we have A < d^{G,w) by (10.1). This proves that 

Conversely, let a and b be orthogonal labellings of G and G that satisfy the con- 
ditions of Theorem 12. Normalize them so that ||a„|p = c{by) and = Wy. Then 
aly = c{ay)c{by) = Wyc{by)/'d = b\y/'d. The lemma now implies that (fen, . . . , 6in)"^ is an 
eigenvector of 6^6, with eigenvalue d. This proves that d <dQ. □ 

Corollary. d{G) = 1 + max{ A{B)/A{-B) \ B is dual feasible for G and 0}. 

Proof. If B is dual feasible for G and 0, its eigenvalues are Ai > • • • > A^ where 
Al = A{B) and A^ = -A(-S). Then B' = I + B/A{-B) has eigenvalues 1 - Xi/Xn, • • • , 
1 — ^n/^n = 0. Consequently B' is positive semidefinite and dual feasible for G and 1, 
and 1 + A{B)/A{-B) = A{B') <MG)- 

Conversely, suppose B' is positive semidefinite and dual feasible for G and 1, with 
A{B') = d = d{G). Let B = B' -I. Then B is dual feasible for G and 0, and < 
A(— -B) < 1 since the sum of the eigenvalues of 5 is tr5 = 0. It follows that -d — 1 = 
A{B)<A{B)/A{-B). □ 

34. Facets of TH. We know that TH(G) is a convex corner set in n-dimensional space, 
so it is natural to ask whether it might have (n — l)-dimensional facets on its nontrivial 
boundary — for example, a straight line segment in two dimensions, or a region of a plane 
in three dimensions. This means it has n linearly independent vectors x^^^ such that 

J24'^c{ay) = l (34.1) 

V 

for some orthogonal labeling a of G. 

Theorem. If TH(G) contains linearly independent solutions x^^\ . . . , a;*^") of (34.1 ), then 
there is a maximal clique Q of G such that 

Proof. Theorem 14 tell us that every x^^^ e TH(G) has xi''^ = c{bi''^) for some orthogonal 
labeling of G. Set Wy = c{ay); then 'd{G,w) = 1, by Theorem 13. We can normalize the 



The Electronic Journal of Combinatorics 1 (1994), # Al 41 
labelings so that \\av\\^ = ai„ = Wy and ||fet;'^''|P = ^i^^ = xi''\ Hence, by Lemma 33, 



/1\ 





ei . (34.3) 



Let 

Q = {v\aiy^O} = {v\c{ay)^0} (34.4) 
and suppose Q has m elements. Then (34.3) is equivalent to the matrix equation 

Ax^''^ = ei (34.5) 

where A is a, d x m matrix and .t*^'^-' has m components xi''\ one for each v & Q. By 
hypothesis there are m linearly independent solutions to (34.5), because there are n linearly 
independent solutions to (34.3). But then there are m — 1 linearly independent solutions 
to Ax = 0, and it follows that A has rank 1: Every row of A must be a multiple of the top 
row (which is nonzero). And then (34.5) tells us that all rows but the top row are zero. 
We have proved that 

c(a„) ^ ^ c(a„) = 1 . (34.6) 

Therefore if u and v are elements of Q we have a„ • Ot, 7^ 0, hence u — v; Q is a clique. 

Moreover, Q is maximal. For if v ^ Q is adjacent to all elements of Q, there is a A; 
such that xi'^^ > 0. But the characteristic labeling of Q U {v} is an orthogonal labeling a' 
such that ^^xi^^c{a'J = 1 + xi^^ > 1, hence a;^'^) ^ TH(G'). □ 

Conversely, it is easy to see that the characteristic labeling of any maximal clique Q 
does have n linearly independent vectors satisfying (34.1), so it does define a facet. For 
each vertex u we let x^"^ = 1, and x^^ = for all v ^ u except for one vertex v E Q 
with V -/- u (when u ^ Q). Then x^'^^ is a stable labeling so it is in TH(G). The point 
of the theorem is that a constraint Xyc{av) < 1 of TH(G) that is not satisfied by all 
X e QSTAB(G') cannot correspond to a facet of TH(G). 



Corollary. 



TH(G) is a polytope <^ TH(G) = QSTAB(G) 

<^ TH(G') = STAB(G') <^ G is perfect. 



Proof. If TH(G) is a polytope it is defined by facets as in the theorem, which are nothing 
more than the constraints of CISTAB(G'); hence TH(G) = QSTAB(G'). Also the antiblocker of 
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a convex corner polytope is a polytope, so TH(G) is a polytope by (30.4); it must be equal 
to QSTAB(G). Taking antiblockers, we have TH(G') = STAB(G') by (30.3). The converses 
are easy since STAB and QSTAB are always polytopes. The connection to perfection is an 
immediate consequence of Theorem 32 and Lemma 2. □ 

We cannot strengthen the corollary to say that i?((j) = a{G) holds if and only if 
'&{G) = k{G)] the Petersen graph (section 27) is a counterexample. 

35. Orthogonal labelings in a perfect graph. A perfect graph has 

■i?(G, w) = a{G, w) = max{a; • w | a; is a stable labeling of G} , (35.1) 

and Theorem 12 tells us there exist orthogonal labelings of G and G such that (12.2) 
and (12.3) hold. But it isn't obvious what those labelings might be; the proof was not 
constructive. 

The problem is to find vectors a„ such that au ■ CLy = when u -/- v and such that 
(12.2) holds; then it is easy to satisfy (12.3) by simply letting 6 be a stable labeling where 
the maximum occurs in (35.1). 

The following general construction gives an orthogonal labeling (not necessarily op- 
timum) in any graph: Let g{Q) be a nonnegative number for every clique Q, chosen so 
that 

^ g{Q) = Wy , for all v . (35.2) 
Furthermore, for each clique Q, let 



V 



\ 0, otherwise. 



Then 



au-ay = ^ g{Q) , 

{u,v}CQ 



hence • a^, = when u -/- v. If we also let ago = \/ diQ) for all «oo = 0, we find 

ao • a„ = a„ • a„ = ^ g{Q) = Wy . 

We have constructed a matrix A that is A-compatible with G and w, in the sense of 
section 29, where 

X = ao-ao = J29iQ)- (35.4) 
Q 
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An orthogonal labeling with costs c{a[j) = Wy/X can now be found as in the proof of 
Lemma 29. 

The duality theorem of linear programming tells us that the minimum of (35.4) subject 
to the constraints (35.2) is equal to the maximum value ofw-x over all x with X^veg < ^ 
for all Q. When x maximizes w ■ x, we can assume that a; > 0, because a negative Xy can 
be replaced by without decreasing w ■ x or violating a constraint. (Every subset of a 
clique is a clique.) Thus, we are maximizing w ■ x over QSTAB(G); the construction in the 
previous paragraph allows us to reduce A as low as k{G,w). But k{G,w) = 'd{G,w) in a 
perfect graph, so this construction solves our problem, once we have computed g{Q). 

The special case of a bipartite graph is especially interesting, because its cliques have 
only one or two vertices. Suppose all edges of G have the form u — v where u E U and 
V & V, and consider the network defined as follows: There is a special source vertex s 
connected to all ti G by a directed arc of capacity w^, and a special sink vertex t 
connected from all G by a directed arc of capacity Wy . The edges u — v of G are also 
present, directed from u to v with infinite capacity. Any flow from s to t in this network 
defines a suitable function g, if we let 

g{{u, v}) = the flow in it — > v , 

g{{u}) = Wu minus the flow in s — > tt , 
g{{v}) = Wy minus the flow in v — > t , 

for all u e U and v E V. Let S he a subset of U UV. If we cut the edges that connect s 
or t with vertices not in S, we cut off all paths from s to t if and only if 5 is a stable 
set. The minimum cut (i.e., the minimum sum of capacities of cut edges) is equal to the 
maximum flow; and it is also equal to 

Wy + Wy — maxjtt; • a; | a; is a stable labeling} = Wy + Wy — a{G, w) . 

ueu vev ueu vev 

Thus the value of A = g{Q) is X^uet/ '^u — {flow from s} + X^^^y Wy — {flow to t} + 
{flow m u V arcs} — a{G, w) — '&{G, w) as desired. 

For general perfect graphs G, a solution to (35.4) with A = i}{G, w) can be found in 
polynomial time as shown in equation (9.4.6) of [7]. However, the methods described in 
[7] are not efficient enough for practical calculation, even on small graphs. 

36. The smallest non-perfect graph. The cyclic graph C5 is of particular interest 
because it is the smallest graph that isn't perfect, and the smallest case where the function 
'&{G, w) is not completely known. 

The discussion following Theorem 34 points out that TH(G) always has facets in com- 
mon with QSTAB(G), when those facets belong also to STAB(G). It is not hard to see that 



The Electronic Journal of Combinatorics 1 (1994), # Al 



44 



QSTAB(C5) has ten facets, defined by Xj = and Xj + Xj mod 5 = 1 for < j < 5; and 
STAB(C5) has an additional facet defined by a;o + iCi + a;2 + 0:3 + 0:4 = 2. The weighted 
functions a and n of section 4 are evaluated by considering the vertices of STAB and QSTAB: 



q;(C5, {m;o, ■ • ■ , "^4}^) = max('U)o + W2,Wi + Ws, W2 + W4, + wq, W4 + Wi) ; (36.1) 



k{C5, {wq, w^}^) = max(a(C5, {wq, w^}^), {wq -\ h W4)/2) 



(36.2) 



Where these functions agree, they tell us also the value of 

For example, let f{x) = '(^(Cs, {a;, 1, 1, 1, 1, 1}^). Relations (36.1) and (36.2) imply 
that f{x) = a; + 1 when x > 2. Clearly /(O) = 2, and section 22 tells us that /(I) = -\/5. 
Other values of f{x) are not yet known. Equation (23.2) gives the lower bound f{x)'^ > 
x'^ + 4. Incidentally, the a vectors 





\/x\ 




1 












v 


/ 



1 

Vx + 1 
,/{x-2){x + l) 

V / 



yx 




{ V 

—yx 



















^/x 

1 



-V^+1 



\^{x-2){x^-l)J 



and h = (1) (0) (0) (0) (0) estabUsh f{x) for x > 2 in the fashion of Theorems 12 and 13. 

Let (p ~ {I + \/5 )/2 be the golden ratio. The matrices A and B' of Theorem 29, when 
G = C5 and w = 1, are 



/V5 
1 
1 
1 
1 

V 1 



1 
1 







1 

0-1 
1 

0-1 





1 


- 1 
1 

- 1 




1 



- 1 
1 

- 1 



1 \ 

0-1 




0-1 
1 / 







-1 


-1 


-1 


-1 


-1 \ 




-1 


1 





0-1 


0-1 





1 


-1 





1 





0-1 


0-1 




-1 


0-1 





1 





0-1 




-1 


</>-i 


0-1 





1 







V-i 





<i)-i 


(/.-I 





1 / 



They have the common eigenvectors 

/ \ 
1 



1 
1 
1 
1 

V 1 y 



-1 
-1 
-1 
-1 
V-i/ 



-1 
V / 



/ \ 




1 

-1 
V-0/ 



/ \ 

1 



-1 
V / 



/ \ 


1 



V-i/ 
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with respective eigenvalues (Aq, ...jAs) = (2-\/5, 0, -\/5/0, -\/5/0, 0, 0) and (//Q) • • • > A*5) = 
(0,2,0,0,1/0,1/0). (Cf. (29.6) and (29.7).) 

37. Perplexing questions. The book [7] explains how to compute ^?(G, w) with given 
tolerance e, in polynomial time using an ellipsoid method, but that method is too slow 
and numerically unstable to deal with graphs that have more than 10 or so vertices. For- 
tunately, however, new "interior-point methods" have been developed for this purpose, 
especially by Alizadeh [1,2], who has computed 'J?(G) when G has hundreds of vertices 
and thousands of edges. He has also shown how to find large stable sets, as a byproduct 
of evaluating 'd{G, w) when w has integer coordinates. Calculations on somewhat smaller 
cyclically symmetric graphs have also been reported by Overton [16]. Further computa- 
tional experience with such programs should prove to be very interesting. 

Solutions to the following four concrete problems may also help shed light on the 
subject: 

PI. Describe TH(C5) geometrically. This upright set is isomorphic to its own anti- 
blocker. (Namely, if (xq, xi, X2, x^, X4) e TH(C5), then so are its cyclic permutations 
{xi,X2,X3,X4,xo), etc., as well as the cyclic permutations of {xo,X4,X3,X2,xi); TH(C5) 
contains the cyclic permutations of (a;o, a;2) a^4) a^i) a^a) and {xo,xs,xi,X4^,X2)-) Can the 
values f{x) = '&{C^, {x, 1, 1, 1, 1}^), discussed in section 36, be expressed in closed form 
when < a; < 2, using familiar functions? 

P2. What is the probable value of '&{G, w) when G is a random graph on n vertices, 
where each of the (2) possible edges is independently present with some fixed probability pi 
(Juhasz [9] has solved this problem in the case w = 1, showing that 'd{G)/^/ (1 — p)n/p 
lies between | and 2 with probability approaching 1 as n — > 00.) 

P3. What is the minimum d for which G almost surely has an orthogonal labeling 
of dimension d with no zero vectors, when G is a random graph as in Problem P2? (Theo- 
rem 28 and the theorem of Juhasz [9] show that d must be at least of order \/n. But Lovasz 
tells me that he suspects the correct answer is near n. Theorem 29 and its consequences 
might be helpful here.) 

P4. Is there a constant c such that '&{G) < c\/na{G) for all n- vertex graphs Gl 
(This conjecture was suggested by Lovasz in a recent letter. He knows no infinite family 
of graphs where 'd{G)/a{G) grows faster than 0{-\/ri/ log n). The latter behavior occurs 
for random graphs, which have a{G) — logi/^n with high probability [4, Chapter XI].) 

Another, more general, question is to ask whether it is feasible to study two- or three- 
dimensional projections of TH(G), and whether they have combinatorial significance. The 
function 'd{G,w) gives just a one-dimensional glimpse. 

Lovasz and Schrijver have recently generalized the topics treated here to a wide variety 
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of more powerful techniques for studying 0-1 vectors associated with graphs [15]. In 
particular, one of their methods can be described as follows: Let us say that a strong 
orthogonal labeling is a vector labeling such that ||a^,|p = c(a„) and • ay > 0, also 
satisfying the relation 

c{au) + c{ay) + c{a^u) — I < au ■ a^ + a^ ■ a^, < c{av) (37.1) 

whenever u -/- w. In particular, when w = v this relation implies that au ■ ay = 0, so the 
labeling is orthogonal in the former sense. 

Notice that every stable labeling is a strong orthogonal labeling of G. Let S he a 
stable set and let u and w be vertices such that u — w. If u and w are not in S, condition 
(37.1) just says that < c(a„) < 1, which surely holds. If u is in S, then w ^ S and (37.1) 
reduces to c{ay) < c(a„) < c(a„); this holds even more surely. 

Let 

TH_(G) = {x \ Xy = c{by) for some strong orthogonal labeling of G}. (37.2) 
(This set is called N+{FR{G)) in [15].) We also define 

(G, w) = max{ wx\x e TH_ (G) } . (37.3) 
The argument in the two previous paragraphs implies that 

STAB(G) C TH_(G) C TH(G) , 

hence 

a{G,w) <^-{G,w) <i}{G,w). (37.4) 

The authors of [15] prove that 'i?_(G, w) can be computed in polynomial time, about as 
easily as i}{G,w)] moreover, it can be a significantly better approximation to a{G,w). 
They show, for example, that TH_(G) = STAB(G) when G is any cyclic graph G^. In fact, 
they prove that if x G TH_(G) and if — Vi, Vi — f2, . . . , V2n — "^o is any circuit 
or multicircuit of G, then Xy^ + Xy-^ + ■ • ■ + Xy^^ < n. This suggests additional research 
problems: 

P5. What is the smallest graph such that STAB(G) ^ TH_(G)? 

P6. What is the probable value of '&-{G) when G is a random graph as in Prob- 
lem P2? 

A recent theorem by Arora, Lund, Motwani, Sudan, and Szegedy [3] proves that there 
is an e > such that no polynomial algorithm can compute a number between a{G) and 
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n^a{G) for all n-vertex graphs G, unless P = NP. Therefore it would be surprising if 
the answer to P6 turns out to be that '(^-(G) is, say, O(logn)^ with probability — > 1 for 
random G. Still, this would not be inconsistent with [3], because the graphs for which 
a{G) is hard to approximate might be decidedly nonrandom. 

Lovasz has called my attention to papers by Kashin and Konyagin [10, 11], which 
prove (in a very disguised form, related to (6.2) and Theorem 33) that if G has no stable 
set with 3 elements we have 

^{G) < ■ (37.5) 

moreover, such graphs exist with 

^G) = 17(n^/7Vlogn) . (37.6) 

Further study of methods like those in [15] promises to be exciting indeed. Lovasz 
has sketched yet another approach in [14]. 
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